1 AACAGCACGA GGGCGAGGGG ACGTCTCCTC TCCCCCAGCT GCTCTGCTCG 

51 GATGGCGCCG CCGGCTGAGT GACGGGGGCG GCGCGCAGGA CTTCCCAGCT 

101 CGGACCTCTT GCCTTCGAGG GGAAAGATGT ACGAGAGTGT AGAAGTGGGG 

151 GGTCCCACCC CTAATCCCTT CCTAGTGGTG GATTTTTATA ACCAGAACCG 

201 GGCCTGTTTG CTCCCAGAGA AGGGGCTCCC CGCCCCGGGT CCGTACTCCA 

251 CCCCGCTCCG GACTCCGCTT TGGAATGGCT CAAACCACTC CATTGAGACC 

301 CAGAGCAGCA GTTCTGAAGA GATAGTGCCC AGCCCTCCCT CGCCACCCCC 

351 TCTACCCCGC ATCTACAAGC CTTGCTTTGT CTGTCAGGAC AAGTCCTCAG 

4 01 GCTACCACTA TGGGGTCAGC GCCTGTGAGG GCTGCAAGGG CTTCTTCCGC 

451 CGCAGCATCC AGAAGAACAT GGTGTACACG TGTCACCGGG ACAAGAACTG 

501 CATCATCAAC AAGGTGACCC GGAACCCCTG CCAGTACTGC CGACTGCAGA 

551 AGTGCTTTGA AGTGGGCATG TCCAAGGAGT CTGTGAGAAA CGACCGAAAC 

601 AAGAAGAAGA AGGAGGTGCC CAAGCCCGAG TGCTCTGAGA GCTACACGCT 

651 GACGCCGGAG GTGGGGGAGC TCATTGAGAA GGTGCGCAAA GCGCACCAGG 

701 AAACCTTCCC TGCCCTCTGC CAGCTGGGCA AATACACTAC GAACAACAGC 

751 TCAGAACAAC GTGTCTCTCT GGACATTGAC CTCTGGGACA AGTTCAGTGA 

801 ACTCTCCACC AAGTGCATCA TTAAGACTGT GGAGTTCGCC AAGCAGCTGC 

851 CCGGCTTCAC CACCCTCACC ATCGCCGACC AGATCACCCT CCTCAAGGCT 

901 GCCTGCCTGG ACATCCTGAT CCTGCGGATC TGCACGCGGT ACACGCCCGA 

951 GCAGGACACC ATGACCTTCT CGGACGGGCT GACCCTGAAC CGGACCCAGA 

1001 TGCACAACGC TGGCTTCGGC CCCCTCACCG ACCTGGTCTT TGCCTTCGCC 

1051 AACCAGCTGC TGCCCCTGGA GATGGATGAT GCGGAGACGG GGCTGCTCAG 

1101 CGCCATCTGC CTCATCTGCG GAGACCGCCA GGACCTGGAG CAGCCGGACC 

1151 GGGTGGACAT GCTGCAGGAG CCGCTGCTGG AGGCGCTAAA GGTCTACGTG 

1201 CGGAAGCGGA GGCCCAGCCG CCCCCACATG TTCCCCAAGA TGCTAATGAA 

1251 GATTACTGAC CTGCGAAGCA TCAGCGCCAA GGGGGCTGAG CGGGTGATCA 

1301 CGCTGAAGAT GGAGATCCCG GGCTCCATGC CGCCTCTCAT CCAGGAAATG 

1351 TTGGAGAACT CAGAGGGCCT GGACACTCTG AGCGGACAGC CGGGGGGTGG 

1401 GGGGCGGGAC GGGGGTGGCC TGCCCCCCCC GCCAGGCAGC TGTAGCCCCA 

1451 GCCTCAGCCC CAGCTCCAAC AGAAGCAGCC CGGCCACCCA CTCCCCGTGA 

1501 CCGCCCACGC CACATGGACA CAGCCCTCGC CCTCCGCCCC GGCTTTTCTC 

1551 TGCCTTTCTA CCGACCATGT GACCCCGCAC CAGCCCTGCC CCCACCTGCC 

1601 CTCCCGGGCA GTACTGGGGA CCTTCCCTGG GGGACGGGGA GGGAGGAGGC 

1651 AGCGACTCCT TGGACAGAGG CCTGGGCCCT CAGTGGACTG CCTGCTCCCA 

1701 CAGCCTGGGC TGACGTCAGA GGCCGAGGCC AGGAACTGAG TGAGGCCCCT 

1751 GGTCCTGGGT CTCAGGATGG GTCCTGGGGG CCTCGTGTTC ATCAAGACAC 

1801 CCCTCTGCCC AGCTCACCAC ATCTTCATCA CCAGCAAACG CCAGGACTTG 

1851 GCTCCCCCAT CCTCAGAACT CACAAGCCAT TGCTCCCCAG CTGGGGAACC 

1901 TCAACCTCCC CCCTGCCTCG GTTGGTGACA GAGGGGGTGG GACAGGGGCG 

1951 GGGGGTTCCC CCTGTACATA CCCTGCCATA CCAACCCCAG GTATTAATTC 

2001 TCGCTGGTTT TGTTTTTATT TTAATTTTTT TGTTTTGATT TTTTTAATAA 

2051 GAATTTTCAT TTTAAGCACA AAAAAAAAAA AAAAAA 



FEATURES : 

Start codon: 127 
Stop codon: 14 98 
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Homologous proteins : 

Top 10 BLAST Hits 



gil 

gil 
gil 
gil 
gil 
gil 
gil 
gil 
gil 
gil 
gil 
gil 
gil 
gil 



3213188|gb|AAC23439.1| (U15211) retinoic acid receptor alpha. 
4506419 I re f | NP_000955 . 1 1 retinoic acid receptor, alpha >gi 1 1 . 
35874 |emb|CAA29787.1! (X06538) retinoic acid receptor (AA 1 . 
7 638251 1 gb 1 AAF65452 . 1 1 (AF242867) ECFP-retinoic acid recepto., 
7441783 I pir | | S78481 retinoic acid receptor alpha-2 - eastern. 

1| (AF088895) retinoic acid receptor alp. 
1| (U41742) nucleophosmin-retinoic acid . 
2| (S50916) retinoic acid receptor alpha. 
1| (U41743) nucleophosmin-retinoic acid . 
(M73779) PML-RAR protein [Homo sapiens] 



4160009|gb|AAD05222. 
1314308|gb|AAB00112. 
8815561 |gb|AAB19602. 
1314310|gb|AAB00113. 
190125|gb|AAA60126.1| 



545176|gb|AAB29813.1| retinoic acid receptor alpha, RAR alph. 
2119682 j pir I | 150674 retinoic acid receptor alpha isoform 2-1. 
1743345|emb|CAA71177.1| (Y10094) retinoic acid receptor-alph . 
133484 |sp I P11416|RRA_M0USE RETINOIC ACID RECEPTOR ALPHA (RAR. 



Score 
918 
830 
830 
830 
829 
829 
827 
827 
827 
827 
824 
819 
816 
815 



0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 



! 



EST: 

gi I 847367 |gb|R73335. 1 IR73335 yll0b08.rl Soares breast 2NbHBst H. 
gi|2714987|gb|AA705069.1|AA705069 zj83ell.sl Soares_f etal_liver . 
gil 8636851 | gb | BE174 125 . 1 IBE174125 QVl-HT0572-200300-117-cl2 HT0 . 
gi|2841314|gb|AA781983.1|AA781983 ai78gl2.sl Soares_testis_NHT . 
gi|611497|gb|T29399.1|T29399 EST79267 Human Placenta Homo sapie. 
gi|1969924|gb|AA317545.1|AA317545 EST19547 Retina II Homo sapie. 

EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 
gi|847367|gb|R73335.1 
gi (2714 987 | gb I AA705069.1 
gi | 8636851 | gb | BE174 125 . 1 
gil 2841314 | gb I AA781983 . 1 
gi|611497|gb|T29399.1 
gil 1969924 | gb I AA317545 . 1 



Human Breast 

Human fetal liver-spleen 

Human head-neck 

Human testis 

Human placenta 

Human retina 



708 
517 
466 
297 
123 
80 



0.0 

e-144 

e-128 

5e-78 

2e-25 

2e-12 



Expression information from PCR-based tissue screening panels 

Human Brain 

Human Placenta 

Human Liver 

Human Hela cells 

Human Kidney 
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1 MYESVEVGGP TPNPFLVVDF YNQNRACLLP EKGLPAPGPY STPLRTPLWN 

51 GSNHSIETQS SSSEEIVPSP PSPPPLPRIY KPCFVCQDKS SGYHYGVSAC 

101 EGCKGFFRRS IQKNMVYTCH RDKNCIINKV TRNPCQYCRL QKCFEVGMSK 

151 ESVRNDRNKK KKEVPKPECS ESYTLTPEVG ELIEKVRKAH QETFPALCQL 

201 GKYTTNNSSE QRVSLDIDLW DKFSELSTKC IIKTVEFAKQ LPGFTTLTIA 

251 DQITLLKAAC LDILILRICT RYTPEQDTMT FSDGLTLNRT QMHNAG FGPL 

301 TDLVFAFANQ LLPLEMDDAE TGLLSAICLI CGDRQDLEQP DRVDMLQEPL 

351 LEALKVYVRK RRPSRPHMFP KMLMKITDLR SISAKGAERV ITLKMEIPGS 

401 MPPLIQEMLE NSEGLDTLSG QPGGGGRDGG GLPPPPGSCS PSLSPSSNRS 

451 SPATHSP 



FEATURES : 

Functional domains and key regions : 







Results of 


Results of 


Results of 


Results of ProfileScan 




interPro 


FPrintScan 


HMMPfam 


PPsearch 


aeainst PROSITE 






against PRINTS 


against PFAM-A 


against PROSITE 


profiles 




IPR000003 


PR00545 










Retinoic acid receptor 


[149-163] 
[340-357] 
[361-380] 










IPR000324 


PR00350 








: s 


Vitamin D receptor 


[83-99] 
[100-119] 








■stss* 


1PR000536 




PF00104 






En 


Ligand-binding domain of nuclear 
hormone receptor 




[225-383] 








IPR001628 


PR00047 


PF00I05 


PS0003 1 




3 ?=? 

rs 1 


C4-type steroid receptor zinc finger 


[83-99] 
[99-114] 
[132-140] 
[140-148] 


[81-156] 


[83-109] 






IPR001723 


PR00398 








l i 


Steroid hormone receptor 


[144-154] 
[226-247] 
[247-263] 
[314-329] 
[371-3881 









Membrane spanning structure and domains: 
Helix Begin End Score Certainty 
1 292 312 0.641 Putative 



FIGURE 2, page 1 of 3 



BLAST Alignment to Top Hit: 

>gi|3213188lgb|AAC23439.1| (U15211) retinoic acid receptor alpha 2 
isoform [Rattus norvegicus] 
Length = 459 

Score = 918 bits (2347), Expect =0.0 

Identities = 445/457 (97%), Positives = 449/457 (97%), Gaps = 2/457 (0%) 
Frame = +1 



Query: 


1 


MYESVEVGG--PTPNPFLWDFYNQNRACLLPEKGLPAPGPYSTPLRTPLWNGSNHSIET 


174 






MYESVEVGG P PNPFLWDFYNQNRACLL EKGLPAPGPYSTPLRTPLWNGSNHSIET 




Sbjct : 


1 


MYESVE VGGLT PAPN PFLWDFYNQNRACLLQEKGLPAPG PYST PLRT PLWNGSNHS I ET 


60 


Query: 


175 


QSSSSEEIVPSPPSPPPLPRIYKPCFVCQDKSSGYHYGVSACEGCKGFFRRSIQKNMVYT 


354 






QSSSSEEIVPSPPSPPPLPRIYKPCFVCQDKSSGYHYGVSACEGCKGFFRRSIQKNMVYT 




Sbjct: 


61 


QSSSSEEIVPSPPSPPPLPRIYKPCFVCQDKSSGYHYGVSACEGCKGFFRRSIQKNMVYT 


120 


Query: 


355 


CHRDKNCIINKVTRNPCQYCRLQKCFEVGMSKESVRNDRNKKKKEVPKPECSESYTLTPE 


534 






CHRDKNCIINKVTRN CQYCRLQKCFEVGMSKESVRNDRNKKKKE PKPECSESYTLTPE 




Sbjct: 


121 


CHRDKNCIINKVTRNRCQYCRLQKCFEVGMSKESVRNDRNKKKKETPKPECSESYTLTPE 


180 


Query: 


535 


VGELIEKVRKAHQETFPALCQLGKYTTNNSSEQRVSLDIDLWDKFSELSTKCIIKTVEFA 


714 






VGELIEKVRKA+QETFPALCQLGKYTTNNSSEQRVSLDIDLWDKFSELSTKCIIKTVEFA 




Sbjct: 


181 


VGELIEKVRKANQETFPALCQLGKYTTNNSSEQRVSLDIDLWDKFSELSTKCIIKTVEFA 


240 


Query: 


715 


KQLPGFTTLTIADQITLLKAACLDILILRICTRYTPEQDTMTFSDGLTLNRTQMHNAGFG 


894 






KQLPG FTTLTIADQITLLKAACLDILILRICTRYTPEQDTMTFSDGLTLNRTQMHNAGFG 




Sbjct: 


241 


KQLPG FTTLTIADQITLLKAACLDILILRICTRYTPEQDTMTFSDGLTLNRTQMHNAGFG 


300 


Query: 


895 


PLTDLVFAFANQLLPLEMDDAETGLLSAICLICGDRQDLEQPDRVDMLQEPLLEALKVYV 


1074 






PLTDLVFAFANQLLPLEMDDAETGLLSAICLICGDRQDLEQPD+VDMLQEPLLEALKVYV 




Sbjct: 


301 


PLTDLVFAFANQLLPLEMDDAETGLLSAICLICGDRQDLEQPDKVDMLQEPLLEALKVYV 


360 



Query: 1075 RKRRPSRPHMFPKMLMKITDLRSISAKGAERVITLKMEIPGSMPPLIQEMLENSEGLDTL 1254 

RKRRPS+PHMFPKMLMKITDLRSISAKGAERVITLKMEIPGSMPPLIQEMLENSEGLDTL 
Sbjct : 361 RKRRPSQPHMFPKMLMKITDLRSISAKGAERVITLKMEIPGSMPPLIQEMLENSEGLDTL 420 



Query: 1255 SGQPGGGGRDGGGLPPPPGSCSPSLSPSSNRSSPATHSP 1371 

SGQ GGG RDGGGL PPPGSCSPSLSPSS+RSSPAT SP 
Sbjct: 421 SGQSGGGTRDGGGLAPPPGSCSPSLSPSSHRSSPATQSP 459 



FIGURE 




Hnmter results : 



Model 


Description 


Score 


E- 


-value 


N 


CE00550 


CE00550 retinoic_acid_receptor 


1204 


3 






0 


1 


CE00342 


E0034 2 retinoid_X_receptor 


341 


6 


6. 


8e- 


99 


2 


PF00104 


Ligand-binding domain of nuclear hormone rec 


241 


4 


1. 


3e- 


68 


1 


PF00105 


Zinc finger, C4 type (two domains) 


192 


6 


1. 


2e- 


56 


2 


CE00055 


CE00055 peroxisome prolif erator_activated_re 


132 


1 


4 . 


9e- 


45 


3 


CE00059 


CE00059 pregnane_steroid_receptor 


126 


1 


4 . 


3e- 


37 


2 


CE00544 


CE00544 estrogen_receptor 


94 


9 


1. 


9e- 


27 


2 


CE00547 


CE00547 steroid_receptor_Ad4BP 


89 


7 


1. 


2e- 


26 


1 


CE00543 


CE00543 steroid_receptor_N10 


85 


2 


2. 


3e- 


36 


2 


CE00036 


CE00036 androstane receptor 


69 


3 




5e- 


25 


2 


CE00208 


CE00208 Progesterone_receptors 


54 


8 


3. 


8e- 


15 


2 


CE00339 


E00339 vitamin_D_receptor 


43 


7 


3. 


4e- 


12 


2 


CE00546 


CE0054 6 glucocorticoid receptor 


40 


1 


1 . 


3e- 


10 


2 


CE00545 


CE0054 5 progesteron_receptor 


36 


9 


3. 


4e- 


11 


2 


CE00386 


E00386 mineralocorticoid_receptor 


10 


2 




0. 


31 


1 


PF00907 


T-box 


3 


8 






4 


1 


CE00341 


E00341 seven-up_receptor 


3 


4 






3 


1 



Parsed for domains: 





Model 


Domain 


seq-f 


seq-t 


hmm-f 


hmm-t 


score 


E- 


-value 




CE00341 


1/1 


82 


97 


175 


190 . . 


3 


4 




3 




CE00545 


1/2 


81 


148 


580 


647 . . 


25 


6 


8 


9e-08 




CE00546 


1/2 


81 


148 


438 


514 . . 


26 


8 


5 


9e-07 




CE00208 


1/2 


81 


148 


568 


635 . . 


47 


1 


5 


6e-13 




CE00386 


1/1 


88 


149 


1 


66 [. 


10 


2 




0.31 




CE00055 


1/3 


83 


149 


109 


174 . . 


91 


7 


3 


4e-31 




CE00036 


1/2 


83 


150 


21 


88 . . 


53 


5 


2 


7e-19 




CE00339 


1/2 


83 


151 


32 


100 . . 


43 


2 


4 


8e-12 




CE00543 


1/2 


83 


156 


327 


400 . . 


75 


4 


2 


7e-32 


za : 


CE00059 


1/2 


83 


156 


39 


113 . . 


73 


5 




2e-21 




PF00105 


1/2 


81 


156 


1 


77 [ ] 


190 


7 


4 


3e-56 




CE00544 


1/2 


83 


157 


189 


263 . . 


75 


4 


9 


8e-22 




CE00547 


1/1 


83 


157 


13 


87 . . 


89 


7 


1 


2e-26 




CE00342 


1/2 


83 


200 


1 


122 [. 


158 


3 




2e-45 




PF00907 


1/1 


212 


228 


1 


18 [. 


3 


8 




4 




CE00545 


2/2 


235 


255 


745 


7 65 . . 


3 


3 




0.6 




CE00546 


2/2 


231 


256 


599 


624 . . 


2 


6 




3.1 




CE00055 


2/3 


236 


257 


297 


318 . . 


14 


7 




9e-05 




CE00544 


2/2 


238 


264 


372 


398 . . 


9 


5 




0.027 




CE00036 


2/2 


237 


264 


185 


212 . . 


12 


5 


0 


00017 




CE00208 


2/2 


226 


266 


724 


764 . . 


3 


8 




1 




CE00543 


2/2 


242 


291 


494 


543 . . 


0 


5 




0.43 




PF00105 


2/2 


326 


334 


1 


9 [. 


5 


4 




2.7 




CE00055 


3/3 


314 


380 


376 


442 . . 


15 


4 


5 


,3e-05 




PF00104 


1/1 


225 


383 


1 


167 [] 


241 


.4 


1 


3e-68 




CE00059 


2/2 


235 


414 


253 


432 .] 


47 


4 


1 


2e-13 




CE00342 


2/2 


223 


416 


119 


313 .] 


176 


7 


8 


7e-51 




CE00339 


2/2 


323 


417 


348 


443 .) 


-0 


.7 




31 




CE00550 


1/1 


55 


457 


. ] 97 


509 .] 


1204 


.3 




0 
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5 



i 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 
2451 
2501 
2551 
2601 
2651 
2701 
2751 
2801 
2851 
2901 
2951 
3001 
3051 
3101 



GTCCTTGGGT 
AATAGGATAC 
CAAGCTAGGC 
TGAAGTAGTC 
TGGACAGTGC 
GGGCGGGGCT 
CCTCTGCCTG 
GGCGGCCGGA 
TCATTCCGAC 
CACTTGGCCC 
GTCTCCGGGG 
GCCCCGCGGG 
GGTCACTCGG 
GCATTGCATT 
GGGAGGACAC 
CCACGGCTTC 
GCCGGAACTG 
CCTCGCAGCC 
TACTCCTCAT 
CCTTCCCAGG 
GGTTCCCTGC 
GCCAGGGGCG 
CGGAGGGGAG 
C AC C AG AG AC 
ACCGCGCGGG 
TGGAGAAGCC 
CAGCCCCGCG 
ACTCTGCCTC 
AGCCGCTGAG 
CCCCCCCTCC 
AGCGGCCGGC 
GCCCGGGTCA 
CTGCTCGGAT 
CCCAGCTCGG 
AGTGGGGGGT 
AGAACCGGGC 
TACTCCACCC 
GTACCGGCCT 
CTTCCACTAC 
TCTCTACCTT 
CCCGGCCGCA 
AGGGCTGAGG 
TCTCTTTGTT 
GCCCCCAGTT 
TCACCAGCAA 
TTGTGGTGGG 
NNNNNNNNNN 
ACACTCAGGA 
CAGCTGTGAG 
TATTGTTAGA 
CTGGAGGGGA 
GCCTCCCTCC 
CCCTGGCAGA 
TTGCTGGGAG 
CCCAGAGCCA 
TTCCTCTAGG 
ATCACCCTTG 
CTTTTTCTCA 
ATAGTTGGAG 
CTTCTCCAGA 
CCCCTGTCTC 
AACTGGAGAG 
GAGCCCTTCC 



AGCATGTACA 
CCCCTCCTCC 
CTTTCTTCCA 
CACTGGAAGG 
ACCAGGGGCC 
TTGCTCGCCG 
TGTTTGTGCC 
GTCACACATG 
ACAGCGTCCG 
CGCGCGACCC 
GAGGTGGCCC 
CGGGCTGGCG 
AGGTGAGGCG 
AGGCAAATGA 
CGGGACCACC 
GCTCGGCCAG 
GTACAAGGGG 
CCCTCCTCTC 
CTGGAGCCTT 
CTGCCCCCAC 
GTTTCTCCCG 
GTGCCTGGAC 
GGGGCGCGCA 
CGAGCGAGTC 
ATCCCCACCC 
CCGGATCCCC 
CTGATCCCGC 
GGACGCCACG 
TTCCCGGGCC 
CAGCACACAC 
GGACTTAGAC 
CCAGTCGGGG 
GGCGCCGCCG 
ACCTCTTGCC 
CCCACCCCTA 
CTGTTTGCTC 
CGCTCCGGAC 
CTCAGTCTGC 
TACTCGGGGG 
TCACTTAACC 
CCCTCCCCCC 
AACTTTGAGA 
TCACTTCTTC 
TGTATATTCT 
GGCTTGGGAA 
TTTTTGGTCT 
NNNNNNNNNN 
GCTCGGAGCA 
GCAGTGTACA 
TGGCTCAGGT 
AGCTCTTCCG 
TGCCTGCCAG 
TTGGAAAAGG 
AAGAGAGACC 
CTTCTATCCC 
GCCGAATCTG 
ACTTTGGAAG 
TTCTCCCAGT 
CAGGGCTGCC 
GGGCTGGAGA 
TTCTTAGCCC 
GGAGGAGGAT 
CCTAAACCAC 



TTTCCATCCC 
AGGGGTATCC 
GTGAAACGTG 
CACCAGCTCT 
GGTACTGGTT 
GAAGCACGCA 
AACAGCACCC 
ATGTCACAGA 
AGCTGCACAA 
GGCCCTACGC 
GGTTCGGCCG 
AGCGGGTGAT 
CCGCCAGGCG 
GGCCCGGCCT 
CCCCTCTTCC 
GGACTGACCA 
AGGACGCCCG 
CCTGTACTCG 
TCCCCCTTCC 
TTGCCTGTCC 
CTGCAGCCGG 
GGGGCGGGGC 
GAGCTGGGGT 
GCCAGCTGCC 
CCACCCGGAA 
GGCTGGGAGG 
CCCCGGCCTG 
AGACTCTAGA 
CCGCCAGGCT 
AACTTCCCTG 
GCGGGGACTT 
CGAGGGGACG 
GCTGAGTGAC 
TTCGAGGGGA 
ATCCCTTCCT 
CCAGAGAAGG 
TCCGCTTTGG 
TGTTGTAGGG 
TGAGAGTCCC 
CGTGTTGCCC 
AGTAACCCTA 
GTTCCTTCTT 
CCAGGAGAAA 
TTCTTGGCCT 
GGGGTGGGGG 
TGCTTACCCG 
NNNNNNNNNN 
CCAGGGTGTA 
CTGGGTGGGC 
TTCTTCCCCT 
TGGAGGATCT 
GGAGGAGGGG 
TTGAAGGGCA 
TGAGATGGAC 
AGCTTTTCCT 
CTGTGTGGCT 
AAGAGAGAGT 
GCTGGAGCAA 
CAGAGTCACC 
GAAGCTGGGA 
ATGGGGACAG 
CACAGAGAGT 
TGGACATGGG 



TTCCTTTTAT 
CCTCTTTCTA 
CATCCCGAGG 
TCCTTTTATC 
CCCCAGCTAG 
GAGCGTGGGG 
GCGCTGCCGC 
CAATGACACA 
TGTCACACCC 
CTCCTGCCGC 
GGCAGGGGGC 
GTCACGGGCA 
AGTTCAGCGA 
GGGTGGGGGT 
CCGCCCCACC 
AACCTTGGGG 
CCCCTCTTCC 
GCGTCCCTCT 
TGCTTCTCTC 
ACATGCCGCC 
ACGCGCCGGG 
GGTGGAAAGG 
GGGGGGGCCG 
CCTGGCCTGG 
TCCTCGCCAC 
AGGAAGTGCT 
CGGACTTGGG 
CGGGAGTCCC 
TCCCTGGGAG 
CTTTTCACCG 
CAGGGCAGGG 
TCTCCTCTCC 
GGGGGCGGCG 
AAGATGTACG 
AGTGGTGGAT 
GGCTCCCCGC 
AATGGCTCAA 
GGTGGGAGTG 
GGGGTGTAGT 
TTGCTGGACA 
AGTGCAATTT 
TTCAAGCAAC 
TGAAGCCCAA 
TGGGAAATCC 
GGTAAAAGGG 
GGGGGGNNNN 
NNNNNNGAAG 
CACCTGGGCA 
GGGAGCAGGC 
CCTGGGCTTT 
CCCACCTTCC 
TGGAGTGGGT 
AAGGACTTAC 
AGACAGCCCA 
ATTGTCCTGC 
GTAGACACAA 
GAGAGGATGA 
GACCCCCCTC 
CCTTCCACTG 
TCTGAGACCT 
CTCAGCTCTT 
AGGACAGGCA 
GAAGTGGAGA 



ATATGGGGGT 
GGGACCTACC 
GCTTCTAGGA 
TCTCCAGAGC 
GAGACACCTT 
AGGAGGGCCC 
GTCGGGTTCC 
AGCCGGTGTC 
GGGTGCCAAA 
CGCTCTCCGC 
TGGCGGGCGA 
GCGGTGGGTG 
GAGTTCAGCC 
GTGTGTTAAG 
ACCTCCTCCA 
GAGCCTGGGA 
GTCCTTGTCC 
GTACTCTGTG 
CTCTCCTCCC 
TCTCCCTCTC 
AATGGGTTAA 
GGGTGGTGCC 
TGGCGCGTAC 
CGGGGGCGGA 
GGAGAATCCC 
CGTTGACCCC 
GAGCCGCTGT 
CTCGAGGTGA 
AGCCGACGGA 
GGACTGGCGG 
GGCGCCCCCT 
CCCAGCTGCT 
CGCAGGACTT 
AGAGTGTAGA 
TTTTATAACC 
CCCGGGTCCG 
ACCACTGTAC 
GGCGGTAGGG 
GGAGGTCCTG 
ATTGAACCCT 
GTGTTAGATT 
ATTCCTTTCA 
GCCCCCTTTG 
CAAAAAGGTT 
TTCCCTGGTC 
NNNNNNNNNN 
GGCTCTGTGC 
TTTTCCTGCG 
GCAAGGGGGT 
GGGCTCTTTG 
TGGACCTGCT 
CTCGGGGGGG 
CCCACCCCTC 
CCTCTGCCCT 
CCCCGACCAT 
GAGGGAAGGT 
CTCTAGGACC 
CCCTAGGGGG 
CCTTGGCCAC 
TGGTCTCCAG 
CCTGGCCCAG 
GTGTATTGGT 
CCTGTCCCCA 
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3151 CATC CAT TCT GGGGTGGGGT AGTAGACCTA GAGGCCTGGG TTTCCAGTTC 

3201 CCGTAGTCTG AGCGTGGGTG TGCATATATA AGTGAGTGAG GTGTCAGTGG 

3251 ACTCGGGTCC TGAGGCTGTG AGGTTGGGAG TGATGGGGGT CTGGGGGCTT 

3301 GCCTTGAGGC ACAGGAAGGA CCCGGAGTCT GAGGGTGGCA ACTAGACTCA 

3351 GTCTAGAATA TGTGGGGCCA ATGCCACCAC CTTGGAAGGG TCCCCTTGGG 

3401 TGTGTTGGAA GTCCGCTGGT GACTGGAGCT GCCTCCAGCC CCCTCTTGGG 

34 51 GAATTCTCCA CTCTCCCCTT TACTGCCACT GAAGGTGGGA AGAGCAGGTT 

3501 GGCTCTGGGA GGAGGTGGCC TGGGTTCTGC AGGGCCCTAG GGACATTGCC 

3551 TCCCTCCCCA GAGCCCTCAT TTCGGTGCAT TAGAGGACAA GGGGGGGTGC 

3601 ACAGGATGTG GCTCCCCATC TGTCTCCCAC CAATCTCCGC CACTCACACC 

3651 TCCGCCCGCT CCCAGACGTC CAAGAATGTG AAGCACGTGG ATGCCCGTAG 

3701 TTGGGGGAGG GGGAGACGCT TATCAGGCGG CCGCTGGGCT AGGGGCCTTC 

3751 TTCCGCTGCC GCGGTACACC CAGAGCTACC CCCGCCTCTC CCCGGGAGGA 

3801 GGAAGGACGG TACAGAGGGC CCTACGCCCC CTCCCCAACC ATCCCCAGGG 

3851 GCTGCGAGGG GAGCTGCGGA GGAGCGGGCG CCAGCTGGAT TGGGAGGGGA 

3901 GCCGCTGGCC GGGGGCCCGG CTGATTTCCT GCTGATCTCC TCCAGGAAAC 

3951 CGGCCCCTTG TGCGAGCCTG CGAACGGCTC GGGGGCGTGG GGAATCCGGA 

4001 GTGGAGCGCT CTGCGCCGCC CGCCCTGCCA GGATGGGGAG CGAGGGAGGG 

4 051 GCACCCTGGC AGCGTCGGCG GGAGGGGACG CCTGGCTTCC TGGGTCAGTT 

4101 CCAGTCCTCT GTTGGGCGCT GGAACTTTGA GCTGAGAAGG TGTGGTCCTT 

4151 CTCTAGCCCG AGTCCTTCTG CAGGAAGAGG AGAGATTGGT GGGCTGGGCC 

4201 TCTGGGGAGG GAGGTTAGCA GGGATGGGCC AGGCCCGGGC AGTCCCTCCC 

4251 CCGTTGGTGT CCCTCCCCAC TCCACCTGTG TGTGCAGGGA GTTATGGCCG 

4 301 TGTCCTAACT CTTGCAGAGG CTGTGAGGAT TCCGGAGTTC CCCACACCTC 

4 351 CGGCCTTGGT CCTTGTACCT CACCTCCTTG GACTGCTGGC TGGAGGCCTG 

4 401 GGGAGGTGGG GCATCGAGCT CTGGGTTCAA AGGGCAGAGC AGGGAAACCT 

4 4 51 CAGAGCTGGG TTACCTGGGT GACAGGTGGG GATGTGCTGG AGGTAGGGGG 

4 501 CAGGCTATGT TACAGCCTCC AAGGCAGTCA AGCTGCCGTT GGGTGGGCTA 

4551 AAAGGAGGCC TTGCCCAGCC TAAACTGTAG TCCTTGCCTC TGGTCATCTC 

4 601 TCCCATTCTG CCAAAAAATA ATTTTAAAAA GCACATTCTC TCAGTTCCGT 

4 651 AAACACCCTC TGTTGGACTT TGCTTTAGCT CCATGTTTTT ATGGCTTTTT 

4701 GCCCTCTAGT CTGTCCCAGG CCTTAGAGCT GTTTACCTCT CATCCTGGTA 

4 751 TCCCCCATGA CTCCCCATAC CCTAGCTCCC CTCGTGACAT CCCGCTCTGT 

4801 ACCCCCAAAG CTCCCTCAGT CCTTTCTCCC TCTCCAGTCT GGTTCATTTT 

4851 AGAAGTGGGG CCTTGGGAGA GGCGGGGCCC AGGGCAAACG GTGGATTAGG 

4 901 AGGGGTGGGG AGGTCAGTGC CTTCTTCCTC TGCTTGTCGG AATGCTGACC 

4 951 AAGATTCTAG GCCATGGTCC CCCCAACCCT CCACATACCC CCTTGCCCTT 

5001 GATCTCCCCT CCCCCCACCA GTCTGGATTG TCTATTGTTA CTGCTTTTAC 

5051 GTCTTGGAAA AAGTTAGCAC AACAAAGGGC TGCTTTGTGG CTCACCCCCT 

5101 CTGCCTCCTG GCCTCACCCA GGCCCCCCAA CCCCGCCCCC CCAGCAGCTG 

5151 TTCTCAGGCC TCTCAGCCTG TCTGATTTGC TTGTCTGGCC TGGGGAGAAT 

5201 GAGGTGGGAG AAAACCAGGC CAGGGCAGTT GGTGTTGGAG TGAAGAGCAG 

5251 ACGGCGGTGG GGAGGTCAGG AGAGAATCTG CTGGGCTGGG GATGGTGTGG 

5301 GCATCAACTG TCCCATTGCT GCAGGCTGGT CTTGGGGCAG GGAAGGGGAT 

5351 GGGGGGCCAT AGCAGTGCTG GTCAGCCAGG CTGGCCTGGG AAGTGGTGCC 

54 01 CAGGCACTAC TAAGAGCCAG GAAAGCCCTG CCAAGGTTGT TGGCCTAGTT 

54 51 CCCTGTCATC AGCCGCCTAG CAGCCCCCAC TGTGTCTGCA GGTAAGGGGG 

5501 GAGGGTGGTA GCACATAGTC AGCCCCTGGT GTTCCCATGC TTCCTTCCTC 

5551 TGTGCCCCAA TTTTAGGGCC ATGTGATTTG GGGCTATGTG ACTCATGTCT 

5601 GTAAGGTGCT TGGGCCAGGA GCTGTGGGCA CCTTTAAATG CCAGCCAGTC 

5651 TCATGTGCCG GAGTTTGGGG TAGGGCTAGG TAGGATTGTG GAATATGGGA 

5701 GGAGGCAGGG ATCTGTCTAC CTAGGGAGGC ATCCTCATCC ATCCTTGGCC 

5751 CTGGACAAGA GAACTTGAAC GTTGGTAGGG GCCTCAGGAC GATGCTGCGT 

5801 GGCCCCTTGG GAATCTGGGA TTGTCCTGGT CATAGTTCTT ATCTTGCACC 

5851 CAACACCCTT AGCTGCCCAG GCTTTGGACA TGGATAGCCC CTACCCAACC 

5901 CAGCCCTGTT CTGCCTACAG TGATGGGCAT GGAGCCAGAC ACTGGGGAGG 

5951 ATTTGGCCAG TGAGGGCTGC CCCTGCTGTC TGGGTCACCC CTCCTGGCTG 

6001 CCCTCTTGGA GCTGAATAAC AGAAGGGGAG GGGTTAGTAA CCCGGACATA 

6051 GTATTGAGGC CAGACAGACA GAGCATTGAT GGGAACAGAC CCCCTTTGTC 

6101 ATGCCATCTC TCCCCAGATG GGGGGTACCC AGAATAATGG GCTTTTGGGG 

6151 CCCTGGGGAC TCTTCTCCCT GTATTCAGGG TATCTCCCCC TATCTCAGGG 

6201 AGACACCTCC TACTGTGCCC AGCATTTGTG ACTCTTCTTT GCACCCCCTG 

6251 CCTTGGGTCC CTGGCCCTGG GATTGTTTGG GTGGAGGAGG GGCAGTGGCT 
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GCTGGCAGAA 
TGGCCGGCTT 
ATGTCCTCTT 
GGGGAGAACG 
GACCCCCACC 
ATGGTTTATC 
CCAGAGGAGT 
AGTCTGGCTG 
TGGTGAGGCT 
TTGGTACTAA 
CTCTCTAGCC 
GCCCTCCCTC 
TGTCAGGACA 
CTGCAAGGTG 
ATGGGGATGT 
CCCTGAATTG 
ACACCTACCA 
CCCTCAAACC 
CCTCCAGCTG 
CATGGTGAGC 
CCCCCGGCGG 
TTAGGAGCAG 
GGCTCCCCAG 
GTGCAGGTAT 
TCCCTGGGGC 
TCAGAATCTT 
TGGAGTAGCT 
GACTCAGGCT 
TTCCTCTGTC 
CAGCCTGGGT 
CTCACATCTT 
TTCTCTGGCC 
CTTTTCCTGT 
TGCCCAGTTA 
GGTCTTTTCC 
AGACTCCTGC 
GGAGGATCCT 
GGTAGGCGAT 
CCCTAGACTG 
ACCTGACTCC 
GAAGAACATG 
AGGTGACCCG 
GTGGGCATGT 
CCTCAGTTGG 
TGGATGTTTG 
CGGTTGAGGA 
CTGGCTGTGT 
ACCATTCCTG 
TGTGTGCATG 
GTGGGGCTCA 
GGGGCTTGAA 
GGTGGAAGGG 
GGATTCAGAG 
CGGGGTTTCC 
AAGGCTGGAC 
TGACCGGTGA 
TTGTCTGCGT 
GGGAGGGGGC 
GGGGGCACGG 
GAATGGAGGG 
CCTGAGCCCA 
AGACCCACAA 
CTGTTTGTGC 



TGGGGTGGAG 
TGAATATCCT 
CCCTCCCTCC 
GGGCCAGGCC 
CCTACTCCCC 
ATTGTATCTT 
GATGGGGTGC 
GGGAGTCCCA 
GGGAGAGGCT 
GGATGGCGAC 
ATTGAGACCC 
GCCACCCCCT 
AGTCCTCAGG 
AGTTGAAGGG 
CCCCACTTCT 
CTGCTCTTCT 
CAGTTTCCCC 
AGAGGTCCCC 
GCAGGGCGTA 
TGGCTGCCGA 
CCCTGCCTGA 
GCACCCTGCC 
TGTCTGGCTG 
ACCTGGTCCT 
CTGAGCCTCT 
CACAGCTTTG 
ATCCCCAGAT 
GTGTGTCTGC 
CGCCTGTCCC 
CTGTGTCTGT 
TCTGCCTTGC 
AGCCCGTGTA 
GTTCTGAGTT 
GCCCCATGTC 
CTGGAAGGGA 
CCATCCTGCA 
TTTAGGTGAA 
GGGCAAACGC 
AGACCGTAGC 
CTCCCCTCCA 
GTGTACACGT 
GAACCGCTGC 
CCAAGGAGTG 
GGTCTCAGAT 
TGCACATGCA 
TGGTTTGTGT 
GTCCACGGGC 
TTTCTGCACG 
CTCCAGGATG 
GAGGCATAGG 
TTCTGAGGGG 
CTGGCACAAG 
TTGGAAGAAA 
ACTTTGGGCT 
AGGGAGACCC 
TTGATGATGT 
GGCCGTTGCC 
TGCAGGACCC 
GTGAGATGGT 
GACTGGAGTG 
GGAAGTGGTG 
GCCTGGAACC 
CACCTTCTGC 



GGGGGAGCGG 
GTTGACCCCA 
TCTTCAAGCG 
GCCCAGGGGC 
AAGCACAGTC 
TGTGGTTTTG 
TGGAGGCTTC 
GTTTTCTTAA 
CTTAGGAGGG 
CTAGGTCTCT 
AGAGCAGCAG 
CTACCCCGCA 
CTACCACTAT 
GTCATTGGGA 
GTGTCCTGGG 
TTCTCTGTGG 
ACAGGTCCTC 
TCCTGCCTCA 
CGCCTGCTCT 
CTGGCTCTTG 
ACCTCACCAT 
TTAGCTTCCC 
GATTTCCCCA 
TAGCAGCCCT 
GTGTGCGTTT 
GGCCTTGCAG 
GTGGGACGGA 
CTGTCCTGTC 
TCTGGTCAGC 
GATGGGTCGG 
TCCTGAGTGC 
TCTGCCTCCT 
CAGGGGTGTG 
TTCCTATTTC 
GGAGTCAGGT 
GTGTTGAGGC 
TCATTCTCCC 
TTGGGGGCAG 
CAGGCACTGC 
TACCCAGGGC 
GTCACCGGGA 
CAGTACTGCC 
TGAGTGCCAT 
GCTCCTAAAG 
TGAACACGCA 
GTAGCTGCAA 
AGGTCTGTGC 
TCTGGCTGTG 
GCTTTCTTCC 
CAGTCCCTTC 
TGCTTGGCTG 
GATCCAAAAG 
GAATTGGGAC 
CAGCTGAGGC 
TCTTGTGTTG 
CAGAGATAAA 
ATGGAGCCTG 
CCTAGCCCTT 
TCTGACTGTT 
TCCTGCCACA 
GCTCCTGCTG 
CTTCGCTTAA 
TCTTTTCATG 



AAGCAGAGGG 
GTTTCCTCTG 
TTAACTCCTT 
AGGAGCTTTA 
ACGGCACACA 
AAGGTGGGGG 
ATTGGCAGCC 
GACTTGAATC 
ACGGTGAGGC 
AACTGCCCCT 
TTCTGAAGAG 
TCTACAAGCC 
GGGGTCAGCG 
AGGACAGCTT 
AGTGTGCAGT 
AAGTTGGCAG 
CCCCATAAAT 
GCTCCTTTCC 
GCCACCGCTG 
GCTGGGGACC 
GGCAGCCTGG 
CTTCAGGTGC 
TCCTCACGTT 
GCGCCCGGCT 
CTTCCTCCAG 
CTCTGGGCTG 
GGTCAAGGGC 
TGGTTGTTCC 
CTGTATGTGG 
TGCACACCTG 
ATGTGTGTGT 
GGCCTCTTCG 
GGTTCCAGAT 
TGACTCACCA 
GTGTGCTGTG 
AGGTACTGGG 
CAGCTTTTCT 
CAGCTGGCCT 
TCCCACTGTG 
TTCTTCCGCC 
CAAGAACTGC 
GACTGCAGAA 
AGGGCAGGGG 
ACCAAGGGAG 
TGCCGTGGTG 
GGACCTGTTT 
TCCGGGACCG 
TGTGCTTGCG 
AGGCCGTGCT 
TGATTGTGAG 
GACTTATGTG 
CCATTGTCTA 
TTCTCAGATC 
CTGATGGAGG 
AATCATGGGT 
TGACGCTGAC 
AGCCTTGGGG 
TGTGGGGAGG 
GCACGAAGAG 
GGAGGCTGGG 
CAAGAGTGGG 
GGGCTGTCAC 
GCAGAAGGAC 



GGCGGGGGAG 
CCCCCAGCTT 
CCTAACTCGG 
GAATCAGGGT 
TACAAATGTG 
TCCTAGGAGT 
TCCTGCCCTG 
CTGCCAGCAG 
AGGGTGGAGC 
CCCCTCTTCT 
ATAGTGCCCA 
TTGCTTTGTC 
CCTGTGAGGG 
GATGAGGTCA 
TGGGGGGTGT 
CAAGCAGGGG 
GTGCAGGGCT 
CTGTCTCTAT 
CCCAGGTTGC 
CAGGAGGCCT 
CAGGAGGCAG 
CCGGGCTGTG 
AGGTGCCAGG 
TCTCCTCCTT 
AGATTGGGGC 
CTCTTCAGCC 
AAAGCACAAG 
TGGTCTGTTC 
AGCCCCTGGC 
TCTTGGTGAA 
TCGCCTCCAT 
GGCTTGTCTT 
CCCTGGCTGT 
GCAGCCCTGA 
GGTTGGGGGA 
ATTCTCCTGA 
GGCCTGCTCA 
GGCCCTCCTC 
GGTGTGGACA 
GCAGCATCCA 
ATCATCAACA 
GTGCTTTGAA 
CCGAGTCCCG 
CAGGGCTCTG 
TGCGGGCTCA 
GCGAGTCTGG 
TGTATGTGTA 
TATGTGTGTG 
TGGTTTTGGG 
TCTTAGGGGA 
TGTATGGGGG 
GTTAAGCCTG 
CCAGAGGAAA 
GAGGGAGGGA 
GTTGCCATGG 
AGACGCCTCC 
GATGGGATGG 
GCAGTGGGGA 
CCCCAGACAG 
GGTGCCTTGT 
TGACAACTCA 
CTCCTCCTCT 
CAGGGAGGGG 
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ACCCCTTCTC CCTCCCACCG CCAACTCCCC TTCTCCCTCC CACCGCCAAC 
TCCCCCTCTC CCGGCTGCTC TGTGCCCCGG AGCTGAGCAG CTGCCATTTC 
AATAGAATTA AAGCTTCCGA ATGATAAACG TCTTGTCACA GCTGCAATTT 
TCTCTTCCCA AATTATCCCC CCACTCTCCC TCTCCCTCTC CCTTCTCTCC 
CCTGCACTTT ATTGAATTTG CAGAATCGAC ATGAGTGATC TCCAAATTAT 
GCCAGCTACC CCCACCTCGC TACCCCCTCC CTGAGCCCCT CCCCCACCCT 
CCCTTCCTCC CGCGTCAGCA GCCACCACCA CCAGCCCTGT GAGTGATTGT 
GTGTCTGGAT AATCGGCTGG TAACGACCCC ATCGCTTCTT TAAAGCCGAG 
TGGTGTGTGC GGCTCAGCGC CCCTGGTGAT TTGTCAGCTC CCCAGCTAAT 
GGGCCAAGAG ATTCTCCCCG CCAGGTCCCC CACTCTCAGG CTGGGGAGCC 
CTACTCCCCA CTTGCCCCAG GAGCTGCTCA GAGCCAGTCC CAAGGGACCC 
CCAGGGAGAC TGCAGCTGGG AGGGCTGGGT GAGTGGAGGC GGGAGAAGGA 
CCTTCCTGGG GAAAGAGGAG GCAGAGCACC TAGGAGGGCA CCGTCGCCTG 
GAGTGTGAGC TGGAGTAGAC GCGTGGGGGA TAGCATGCGG CTGGCTATGG 
GGTGGGGTGG GGGGTGTGTG CAGGGCCACA GCTGTGCTCA TGGGGCTTCT 
GGGGCAGAAC TTGATGTGTG GGTTGGGTGG GCATGGAGGG CTGGAGTGCG 
TGGCAATGCC TTGCCTGCCC GTGAACGCGT GCTGTGTGCG CGTGCTTACA 
AGCCTGGGTG ACCTCCTCAG CAGCTGGCAG CTCTCTGTCA GGCTGGGGGT 
GGACGAGGCC CTGAGCAGCC TGCAGCTGCC CTCTTAACCC CCTCTGCCCT 
CCACAGCTGT GAGAAACGAC CGAAACAAGA AGAAGAAGGA GGTGCCCAAG 
CCCGAGTGCT CTGAGAGCTA CACGCTGACG CCGGAGGTGG GGGAGCTCAT 
TGAGAAGGTG CGCAAAGCGC ACCAGGAAAC CTTCCCTGCC CTCTGCCAGC 
TGGGCAAATA CACTACGGTA TGGCTTTCCC CCGGCCTGCA GGGTGGGATT 
TGCCCAGGGC CACAGGGCCA GGATGGGCCC CTCTCAGGCA CCCCTTCTTG 
TGCCAGGCAA GATCTCTGCG TCCTTCCCTT CCCCTCTCTT CTCCCTCCTC 
CTGCTGCCTC TTCCCAAGGA GCTCCCAGGA AGTGAAGGCT GGGTAGAGGG 
CAGGCCTGTG GGGGCTGGAG CCAGGCTGAG AAGGGGTGCC ATGGAGAAGA 
AGGCCCTCAC TCTCCCTCCT CCCCCAGAAC AACAGCTCAG AACAACGTGT 
CTCTCTGGAC ATTGACCTCT GGGACAAGTT CAGTGAACTC TCCACCAAGT 
GCATCATTAA GACTGTGGAG TTCGCCAAGC AGCTGCCCGG CTTCACCACC 
CTCACCATCG CCGACCAGAT CACCCTCCTC AAGGCTGCCT GCCTGGACAT 
CCTGGTGAGG GTCTGCACCC TGGCCCCCAG GCACTGCCCC TGTGTCCTGG 
GTAGATGTCC TTCCAGCCAG ACAGCCACCC TCCTAAATGT CTGTCTGCAA 
TCAACCTGTC CAAATGCCCA CCGCCCAAAT GTCTGCCCTT CCTCTCCCCA 
TATGTCCACC TGTCCACTCG TCTCCCTGTC CACTCAGCCA CCTAGCAGCC 
AGATGTGCAG GAGCTCACCT GTTCACCCAT ACACATATCC AGCCACCCAG 
CCATCCATCC ATTTAGCCAG TAATAAAGAT TCACGTAGGA GCCAGGTGCA 
GTGGCTCATA CCTGTAATCC CAGCACTTTG GGAGGCCGAG CGAGGCAGGA 
GGATCACTTG AGGCTGGAAG TTCAAGACCA CCCTGGGCAA CATAGTGAGA 
CCTTATTTCT GCAAAAAACT AAAAAGATTC ACCTAGGATC CTCTGGCCAG 
TGTTCGAGCT GGGTGTCAGG AACCCAGCGG TGAATGCACC ACCATCCCCT 
CTCTTGAAAA CCTTCCATGT GAGGCAAGAG ATAAGTCAAC AGAGGTTGCA 
AAACTGTGAT CAATGCTTCC TGGAGATTGG GGGAGGGCTT GTGACTGCTT 
GGGCCTGAAG GATGATGTCT CAGAGGAGGT GACATCTAGG GGTTTGTAGA 
GGGGGAGGTG AGAGGGTAGC CCTAACTCAG GAGCAGGAAG TGAAAGACTT 
GCTGCTGTGA GGCCATGCTG AGCTCAGGGG ACTGCCGGGC ACTCGGTGAG 
GTGAGCCCGA GGGTAGACTG GGCTGGAGGC TGGATGCAGG GGGTGGGGGC 
AGGAAGAGGT GGTGGGAACT GCCAAAGCCT AGGCTGGAGG GAGCACTCTC 
CTTCCTGCTG TCCCTGACAA GGGCTCGGTC CACCTGTTCC CTCTTGGTCA 
CCTCCAGGGT GGGGAACCTG GGATTTGACG AGACTGTCAT TTCTTTTTAT 
GTTTTTCTTT TTTGAGATGG AGTTTCACTC TTGTCACCCA GGCTGGAGTG 
CAGTAGTATG ATCTTGGCTC ACTGCAGCCT GCAACTGCTG CCTCCCGGGT 
TCAAGCGATT CTCCTGCCTC AGCCTCCTGA GTAGCTGGGA TTACAGGCAC 
CCGCCACCAC ACCCGGCTAA TTTTTGTATT TTTGTAGAGA CGGGGTTTCA 
CCATGTTGGC CAGGCCGGTC TCGAACTCCT GACCTCAGGT GATCCTCCCG 
CGTGAGCCGG CAGACTGTCA TTTCTCCATG GGCACCTCTG AATGTTGAGG 
CGGGTGATGG GTGGGAGGTT TAGATTGTGC TGCCTGCAGG GGCTCCCATC 
CCCATGCCGT GGATGCAGGA GGTGCCGTCT GGGTTCCTGC AACCACATTC 
AAGCCAATAC ACATTTACTG AGCGCTTGTT GTGTACCTCA TCCTGGGAGC 
TGTAGGCAGC AGCCCAGTGT TCCTTAGCTC CTAGAAATTC TAGGTCCCCT 
CTACATTCTT TGCATGTAGG CAGGATGACC TGGACCTGCA CTATCCAGTA 
CAGTAGCTGC TCACCACATG TGACTCTTTA AATTTAAATT AATTAAAATT 
AAACTCAATT CAGTTCCTCA GTTGCATTAG CCACATTTCA AGTACTCAGT 
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AGACGCATGT GGCTGGTGGC TGAGGTATGG ATGGTGCAGA CGTAGAACCT 
TTCCATCATT GTAGAAAATT CTATCAGACA GCATTGCTCC GGCCACCTGC 
CAGGTGGTCC TCCGGGAGTG CTGGTGCGGA GTGCTGGTGC CGAGTGCTCA 
GAGTGGGTTC GGGTTCAGTC CCTGAACCCA AGCATCCTCT GCACCCAGAT 
CCTGCGGATC TGCACGCGGT ACACGCCCGA GCAGGACACC ATGACCTTCT 
CGGACGGGCT GACCCTGAAC CGGACCCAGA TGCACAACGC TGGCTTCGGC 
CCCCTCACCG ACCTGGTCTT TGCCTTCGCC AACCAGCTGC TGCCCCTGGA 
GATGGATGAT GCGGAGACGG GGCTGCTCAG CGCCATCTGC CTCATCTGCG 
GAGGTGGGCA GGGGGCCTGG GTCTGGGGGC TGGGCTGGGA CGGGGGTGCA 
GCCCTGGAGT CTCTTCCAGG GAGCTCTTTC AGGCCACCTC TGTTAGGTAT 
CTCTAGAGGG CAGGGTCTGG TCTGCAACTA CACAGCAAGG GGGCCATGTG 
GGGCCTGGAC TCCTGTTCCC GATTTCTGGG CAACACCCCT TCTAGGGAGG 
TTAAGAGTGA GGGTTTGAGG GTCGGACCAA CCAGGGTCAC CTCCTGGCCG 
ATGCATGACC CTGAGCAGGT TGCTGAACTT CTCTGGGCCT CCGTTTCTGT 
ACAGTGGGGG CGGTAACGGT CTCTAGCTCA TGAAGTTGAT GGGAGGATTA 
CGGTGGTAAC AGATACTGTG CAGGTGCCCA GAGCGAGCTC CAGTGCTTGT 
TAGTTGCTAT TTTATTGTTG TGATTTCTGC CATTTCATCT GGTTTCCAGA 
ATAACAGGGG GGAGTGGGAG CCTGCCTGGG AACCCTCTCC CTGCTTGAGG 
ATGGCACTGC CCATTTGGGG TCCCATCCCA CTAACTGGGC TCAGGGAGGG 
TTTGGGGCAC CCCCTCACCC TCAGCTCCCG TTGCTCCCTT TTAAGGGCCT 
CTGTACCCTG CGGCAGCAGA GACCCCATGC CCTGCCCTGT GTGGGGAGGC 
GCCTGCGAGC TGCCCTCCTC CATGGCCTGG GCAGGCACGC CCCCCGGTGG 
CCGAGGCTGG GGGTGCAGCT GTGTTCCCAG CTGCTCAGGG GGTGGTTCTG 
CTTCCTCAGA CCGCCAGGAC CTGGAGCAGC CGGACCGGGT GGACATGCTG 
CAGGAGCCGC TGCTGGAGGC GCTAAAGGTC TACGTGCGGA AGCGGAGGCC 
CAGCCGCCCC CACATGTTCC CCAAGATGCT AATGAAGATT ACTGACCTGC 
GAAGCATCAG CGCCAAGGGT GAGGCTCACA GACCTGGAGG GGTACCGGCC 
CCCGACACCT GGCCCAGGCC CCCACATCCA AGCCAGCACC CCATGTCTTT 
GTGCCAGGAC AATACGACAC CTGTCCCCAT CTGTGTCTAG GCTGAGGTCC 
CCTAGTGACT CCACTTTGCC GAGGTGGCCC GCCTGTGTCA CCTTTGTGTG 
GTAGTTCAGA TCGTGGCTCT GGAACCAGAC ACGTGGGTGT GTGTCCTTGT 
GTGGGTCACT CAACAGCTCC TAGCTACAGT TTCCCTTCCG AGGGCGGGGA 
TAACATTCGT GTTTACAGAG GGGTCGGGAT GATCCCTAGC ACACAGCACA 
GGGGAAGGAA GGGCTTGGCG TCTAGCCCAG GCCGGCAGTC TGGCCCTGGA 
GCCGGAGTTC GGGACCACTT TGCCCCATTG CCACCAGCCT CTGGACCTGG 
GGGCTTAAGA GAGCTGGCTC GTGTCAAAGA ACTGAATCCC AAGAAAGATG 
CTAATATCAG CAGTATTGAT CTTCCCACCT CGAGCCAGGC TTGCTGGGGC 
TGGGGGTGGG AGGGCTGGCC CAGCGTGCTG ACCTCTGCCC CCTCCTTTCC 
TGCAGGGGCT GAGCGGGTGA TCACGCTGAA GATGGAGATC CCGGGCTCCA 
TGCCGCCTCT CATCCAGGAA ATGTTGGAGA ACTCAGAGGG CCTGGACACT 
CTGAGCGGAC AGCCGGGGGG TGGGGGGGCG GGACGGTGGT GGCCTTCTGC 
AGTAAAAAGT GCCCTGATGC CACCATTGCC GTAAAAACTA ATGCCCAATT 
GTGATAAGGA GCTACCGGGG TACACACGGG GACTGGTTCA AATGGGGCAT 
CGCCGAAGCA TGTGATGCTA TGAACTTAAT CGGACTATTA TTCTGGTGGA 
TCCTCAAACC AGCATCGCAA CCTGGACACT CTTTTGCATG GTCGTTATTA 
TCTCCGGTAG ACTCCTTGCC TCCCTTTTAC ATAAAAAGGC CTCCCCCGAC 
AAAAAGGGTC AGTTCGATCC CCACTTTCGG TTCGGGAGCC TACCGTGTGC 
CAAAGGCCCT TAATCTCGAA AATATCCCAA TTACCTGATG TCGTGCGACG 
CCTAAAAATT CCCCGTGTTG CCACCACTGC TTGAAACCCC CAAGCTTGGG 
TGTTAATCCC GAATTGGGGG CCCCCCGTNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNGG GCCCCCCCGC CAGGCAGCTG 
TAGCCCCAGC CTCAGCCCCA GCTCCAACAG AAGCAGCCCG GCCACCCACT 
CCCCGTGACC GCCCACGCCA CATGGACACA GCCCTCGCCC TCCGCCCCGG 
CTTTTCTCTG CCTTTCTACC GACCATGTGA CCCCGCACCA GCCCTGCCCC 
CACCTGCCCT CCCGGGCAGT ACTGGGGACC TTCCCTGGGG GACGGGGAGG 
GAGGAGGCAG CGACTCCTTG GACAGAGGCC TGGGCCCTCA GTGGACTGCC 
TGCTCCCACA GCCTGGGCTG ACGTCAGAGG CCGAGGCCAG GAACTGAGTG 
AGGCCCCTGG TCCTGGGTCT CAGGATGGGT CCTGGGGGCC TCGTGTTCAT 
CAAGACACCC CTCTGCCCAG CTCACCACAT CTTCATCACC AGCAAACGCC 
AGGACTTGGC TCCCCCATCC TCAGAACTCA CAAGCCATTG CTCCCCAGCT 
GGGGAACCTC AACCTCCCCC CTGCCTCGGT TGGTGACAGA GGGGGTGGGA 
CAGGGGCGGG GGGTTCCCCC TGTACATACC CTGCCATACC AACCCCAGGT 
ATTAATTCTC GCTGGTTTTG TTTTTATTTT AATTTTTTTG TTTTGATTTT 
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157 51 TTTAATAAGA ATTTTCATTT TAAGCACATT TATACTGAAG GAATTTGTGC 

15801 TGTGTATTGG GGGGAGCTGG ATCCAGAGCT GGAGGGGGTG GGTCCGGGGG 

15851 AGGGAGTGGC TCGGAAGGGG CCCCCACTCT CCTTTCATGT CCCTGTGCCC 

15901 CCCAGTTCTC CTCCTCAGCC TTTTCCTCCT CAGTTTTCTC TTTAAAACTG 

15951 TGAAGTACTA ACTTTCCAAG GCCTGCCTTC CCCTCCCTCC CACTGGAGAA 

16001 GCCGCCAGCC CCTTTCTCCC TCTGCCTGAC CACTGGGTGT GGACGGTGTG 

16051 GGGCAGCCCT GAAAGGACAG GCTCCTGGCC TTGGCACTTG CCTGCACCCA 

16101 CCATGAGGCA TGGAGCAGGG CAGAGCAAGG GCCCCGGGAC AGAGTTTTCC 

16151 CAGACCTGGC TCCTCGGCAG AGCTGCCTCC CGTCAGGGCC CACATCATCT 

16201 AGGCTCCCCA GCCCCCACTG TGAAGGGGCT GGCCAGGGGC CCGAGCTGCC 

16251 CCCACCCCCG GCCTCAGCCA CCAGCACCCC CATAGGGCCC CCAGACACCA 

16301 CACACATGCG CGTGCGCACA CACACAAACA CACACACACT GGACAGTAGA 

16351 TGGGCCGACA CACACTTGGC CCGAGTTCCT CCATTTCCCT GGCCTGCCCC 

16401 CCACCCCCAA CCTGTCCCAC CCCCGTGCCC CCTCCTTACC CCGCAGGACG 

16451 GGCCTACAGG GGGGTCTCCC CTCACCCCTG CACCCCCAGC TGGGGGAGCT 

16501 GGCTCTGCCC CGACCTCCTT CACCAGGGGT TGGGGCCCCT TCCCCTGGAG 

16551 CCCGTGGGTG CACCTGTTAC TGTTGGGCTT TCCACTGAGA TCTACTGGAT 

16601 AAAGAATAAA GTTCTATTTA TTCTACACAT GCCTCCAGCC TTGCTGCCTC 

16651 CACCCCCTCC TCTTGGCGTC TGGTCTGGGG GCTTGGGATG GGTTTCGTCA 

16701 TGTGCTCTGG GCCTGTGATG GCCAGGAATG AGCACTGGGG CCAAGGGGCT 

167 51 GGCCAGGGCA CCCTTCCAAG CTGCCTTCTG AGGCTTACCT TGTGCTGGGG 

16801 TCTTTGGAGA TGCTGAGAAG GAGAAAGTCC TGCCCCTTGG GAAGCCCTCA 

16851 GTCTGGGGAT CCACACTGCC CATGTCAAGG AGCCCCAGTC TGGGAGTGGG 

16901 AGAGAAGAGG AGGAAAGCTG CCCCCACCTT CAGGGAACCC CCAGTCTGAG 

16951 GGAGGAAGCC GGAGCCACCC CTAGACATTT CTGGTCCTTG GGAAGCCTTC 
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Exon : 
Intron : 
Exon : 
Intron : 
Exon : 
Intron : 
Exon : 
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Exon : 
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Exon: 
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Start 
Exon : 



1684 

1684-1846 
1847-6808 
6809-6957 
6958-8277 
8278-8419 



Exon: 
Stop: 



8420-10406 

10407-10567 

10568-10827 

10828-11004 

11005-12798 

12799-13003 

13004-13759 

13760-13918 

13919-14505 

14506-14658 

14659-15143 

15144-15208 

15206 



Map: 

Bac Accession AC018629 
Homo sapiens chromosome 17 
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Position 


Allele 1 


Allele 2 


Context 


4084 


C 


G 


GCCTCTCCCCGGGAGGAGGAAGGACGGTACAGAGGGCCCTACGCCCCCTCCCCAACCATCCCCAGG 
GGCTGCGAGGGGAGCTGCGGAGGAGCGGGCGCCAGCTGGATTGGGAGGGGAGCCGCTGGCCGGGGG 
CCCGGCTGATTTCCTGCTGATCTCCTCCAGGAAACCGGCCCCTTGTGCGAGCCTGCGAACGGCTCG 
GGGGCGTGGGGAATCCGGAGTGGAGCGCTCTGCGCCGCCCGCCCTGCCAGGATGGGGAGCGAGGGA 
GGGGCACCCTGGCAGCGTCGGCGGGAGGGGACGCCT [C, G] 

GCTTCCTGGGTCAGTTCCAGTCCTCTGTTGGGCGCTGGAACTTTGAGCTGAGAAGGTGTGGTCCTT 
CTCTAGCCCGAGTCCTTCTGCAGGAAGAGGAGAGATTGGTGGGCTGGGCCTCTGGGGAGGGAGGTT 
AGCAGGGATGGGCCAGGCCCGGGCAGTCCCTCCCCCGTTGGTGTCCCTCCCCACTCCACCTGTGTG 
TGCAGGGAGTTATGGCCGTGTCCTAACTCTTGCAGAGGCTGTGAGGATTCCGGAGTTCCCCACACC 
TCCGGCCTTGGTCCTTGTACCTCACCTCCTTGGACT 


6482 


G 


A 


ATCTCCCCCTATCTCAGGGAGACACCTCCTACTGTGCCCAGCATTTGTGACTCTTCTTTGCACCCC 
CTGCCTTGGGTCCCTGGCCCTGGGATTGTTTGGGTGGAGGAGGGGCAGTGGCTGCTGGCAGAATGG 
GGTGGAGGGGGGAGCGGAAGCAGAGGGGGCGGGGGAGTGGCCGGCTTTGAATATCCTGTTGACCCC 
AGTTTCCTCTGCCCCCAGCTTATGTCCTCTTCCCTCCCTCCTCTTCAAGCGTTAACTCCTTCCTAA 
CTCGGGGGGAGAACGGGGCCAGGCCGCCCAGGGGCA [G, A] 

GAGCTTTAGAATCAGGGTGACCCCCACCCCTACTCCCCAAGCACAGTCACGGCACACATACAAATG 
TGATGGTTTATCATTGTATCTTTGTGGTTTTGAAGGTGGGGGTCCTAGGAGTCCAGAGGAGTGATG 
GGGTGCTGGAGGCTTCATTGGCAGCCTCCTGCCCTGAGTCTGGCTGGGGAGTCCCAGTTTTCTTAA 
GACTTGAATCCTGCCAGCAGTGGTGAGGCTGGGAGAGGCTCTTAGGAGGGACGGTGAGGCAGGGTG 
GAGCTTGGTACTAAGGATGGCGACCTAGGTCTCTAA 


8066 


C 


G 


TCTGTGATGGGTCGGTGCACACCTGTCTTGGTGAACTCACATCTTTCTGCCTTGCTCCTGAGTGCA 
TGTGTGTGTTCGCCTCCATTTCTCTGGCCAGCCCGTGTATCTGCCTCCTGGCCTCTTCGGGCTTGT 
CTTCTTTTCCTGTGTTCTGAGTTCAGGGGTGTGGGTTCCAGATCCCTGGCTGTTGCCCAGTTAGCC 
CCATGTCTTCCTATTTCTGACTCACCAGCAGCCCTGAGGTCTTTTCCCTGGAAGGGAGGAGTCAGG 
TGTGTGCTGTGGGTTGGGGGAAGACTCCTGCCCATC [C, G] 

TGCAGTGTTGAGGCAGGTACTGGGATTCTCCTGAGGAGGATCCTTTTAGGTGAATCATTCTCCCCA 
GCTTTTCTGGCCTGCTCAGGTAGGCGATGGGCAAACGCTTGGGGGCAGCAGCTGGCCTGGCCCTCC 
TCCCCTAGACTGAGACCGTAGCCAGGCACTGCTCCCACTGTGGGTGTGGACAACCTGACTCCCTCC 
CCTCCATACCCAGGGCTTCTTCCGCCGCAGCATCCAGAAGAACATGGTGTACACGTGTCACCGGGA 
CAAGAACTGCATCATCAACAAGGTGACCCGGAACCG 


8699 


T 


C 


AAGTGGGCATGTCCAAGGAGTGTGAGTGCCATAGGGCAGGGGCCGAGTCCCGCCTCAGTTGGGGTC 
TCAGATGCTCCTAAAGACCAAGGGAGCAGGGCTCTGTGGATGTTTGTGCACATGCATGAACACGCA 
TGCCGTGGTGTGCGGGCTCACGGTTGAGGATGGTTTGTGTGTAGCTGCAAGGACCTGTTTGCGAGT 
CTGGCTGGCTGTGTGTCCACGGGCAGGTCTGTGCTCCGGGACCGTGTATGTGTAACCATTCCTGTT 
TCTGCACGTCTGGCTGTGTGTGCTTGCGT ATGTGTG [ T , C ] 

GTGTGTGCATGCTCCAGGATGGCTTTCTTCCAGGCCGTGCTTGGTTTTGGGGTGGGGCTCAGAGGC 
ATAGGCAGTCCCTTCTGATTGTGAGTCTTAGGGGAGGGGCTTGAATTCTGAGGGGTGCTTGGCTGG 
ACTTATGTGTGTATGGGGGGGTGGAAGGGCTGGCACAAGGATCCAAAAGCCATTGTCTAGTTAAGC 
CTGGGATTCAGAGTTGGAAGAAAGAATTGGGACTTCTCAGATCCCAGAGGAAACGGGGTTTCCACT 
TTGGGCTCAGCTGAGGCCTGATGGAGGGAGGGAGGG 


12897 


C 


T 


CAGTAGACGCATGTGGCTGGTGGCTGAGGTATGGATGGTGCAGACGTAGAACCTTTCCATCATTGT 
AGAAAATTCTATCAGACAGCATTGCTCCGGCCACCTGCCAGGTGGTCCTCCGGGAGTGCTGGTGCG 
GAGTGCTGGTGCCGAGTGCTCAGAGTGGGTTCGGGTTCAGTCCCTGAACCCAAGCATCCTCTGCAC 
CCAGATCCTGCGGATCTGCACGCGGTACACGCCCGAGCAGGACACCATGACCTTCTCGGACGGGCT 
GACCCTGAACCGGACCCAGATGCACAACGCTGGCTT [ C , T ] 

GGCCCCCTCACCGACCTGGTCTTTGCCTTCGCCAACCAGCTGCTGCCCCTGGAGATGGATGATGCG 
GAGACGGGGCTGCTCAGCGCCATCTGCCTCATCTGCGGAGGTGGGCAGGGGGCCTGGGTCTGGGGG 
CTGGGCTGGGACGGGGGTGCAGCCCTGGAGTCTCTTCCAGGGAGCTCTTTCAGGCCACCTCTGTTA 
GGTATCTCTAGAGGGCAGGGTCTGGTCTGCAACTACACAGCAAGGGGGCCATGTGGGGCCTGGACT 
CCTGTTCCCGATTTCTGGGCAACACCCCTTCTAGGG 


14442 


C 


T 


TGTCCTTGTGTGGGTCACTCAACAGCTCCTAGCTACAGTTTCCCTTCCGAGGGCGGGGATAACATT 
CGTGTTTACAGAGGGGTCGGGATGATCCCTAGCACACAGCACAGGGGAAGGAAGGGCTTGGCGTCT 
AGCCCAGGCCGGCAGTCTGGCCCTGGAGCCGGAGTTCGGGACCACTTTGCCCCATTGCCACCAGCC 
TCTGGACCTGGGGGCTTAAGAGAGCTGGCTCGTGTCAAAGAACTGAATCCCAAGAAAGATGCTAAT 
ATCAGCAGTATTGATCTTCCCACCTCGAGCCAGGCT [C, T J 

GCTGGGGCTGGGGGTGGGAGGGCTGGCCCAGCGTGCTGACCTCTGCCCCCTCCTTTCCTGCAGGGG 
CTGAGCGGGTGATCACGCTGAAGATGGAGATCCCGGGCTCCATGCCGCCTCTCATCCAGGAAATGT 
TGGAGAACTCAGAGGGCCTGGACACTCTGAGCGGACAGCCGGGGGGTGGGGGGGCG 
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4084 
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Intron 








6482 
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Intron 
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8699 
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Exon 


237 
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